Moving Window Network Coding in Cooperative 

Multicast 

Fei Wu, Student Member, IEEE, Cunqing Hua, Member, IEEE, Hangguan Shan, Member, IEEE, Aiping 

Huang, Senior Member, IEEE 



(N 

o 

OO 



Abstract — Cooperative multicast is an effective solution to address the bottleneck problem of single-hop broadcast in wireless 
networks. By incorporating with the random linear network coding technique, the existing schemes can reduce the retransmission 
overhead significantly. However, the receivers may incur large decoding delay and complexity due to the batch decoding scheme. In 
addition, the dependency on the explicit feedback leads to scalability problem in larger networks. In this paper, a cooperative multicast 
protocol named MWNCast is proposed based on a novel moving window network coding technique. We prove three properties of the 
proposed scheme. Firstly, without explicit feedback, MWNCast can approach the cooperative capacity with the packet loss probability 
dropping almost exponentially with the increase of window size. Secondly, the average decoding delay of a receiver is on the order of 
0( pj^p ) with respect to its traffic intensity p. Thirdly, MWNCast can achieve the linear decoding complexity of 0(W) with respect 
to the window size W . Simulation results show that MWNCast outperforms the existing schemes by achieving better tradeoff between 
the throughput and decoding delay, meanwhile keeping the packet loss probability and decoding complexity at a very low level without 
explicit feedback. 
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1 Introduction 

Due to the broadcast nature of wireless channels, wire- 
less networks have been deemed as an efficient solu- 
tion for multicast file delivery, multimedia streaming 
services, etc. Under perfect channel conditions, multi- 
ple clients within the transmission range of a single 
transmitter node can receive the same piece of data 
simultaneously without incurring any extra overhead. 
However, this assumption is invalid in practice since 
wireless channels are subject to fast fading due to signal 
attenuation, shadowing and multipath effects, leading to 
random failure of packet reception at different clients. 

Although packet error can be tolerated to some extents 
in most multimedia streaming applications, excessive 
packet losses are unacceptable because it can lead to the 
degradation of quality of experience (QoE) to the end 
users. In order to improve the reliability of multicast, 
many techniques and protocols have been developed. 
One class of solutions follow the error recovery path that 
tries to tackle the packet loss problem using the auto- 
matic repeat request (ARQ) or combined with forward 
error correction (FEC) (e.g., |2l |3fl), which however lead 
to feedback storm problem since the source node relies 
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on the feedback from clients to make retransmission 
decisions. To address this issue, another class of schemes 
adopt the rateless coding strategy (e.g., J3-0])/ whereby 
the source node keeps transmitting coded symbols with- 
out explicit feedback, and any clients can decode the 
packet after accumulating enough symbols. Although 
such approaches are able to provide reliable transmis- 
sions, they may suffer from the bottleneck problem, that 
is, the throughput of the overall system is limited by the 
node with the worst channel capacity. 

As a natural solution to the bottleneck problem in 
multicast, cooperative communications have drawn in- 
creasing attentions recently. In |7], integrated with lay- 
ered video coding and packet level forward error cor- 
rection, the randomized distributed space time codes 
are adopted to design cooperative multicast scheme that 
can provide efficient and robust video delivery. Relay 
selection has been studied in [8] to improve the perfor- 
mance of cooperative multicast in a mobile computing 
environment. The outage probability with cooperative 
multicast is analyzed in [9[|, which suggests that the 
performance can be improved with more relay nodes. 
These schemes demonstrate the effectiveness of physical- 
layer cooperation in alleviating the bottleneck problem 
in multicast, but they may incur some difficulties in prac- 
tical implementation, such as tight time synchronization. 
Furthermore, the sequential retransmissions of the lost 
packets to multiple receivers (requested by feedback) can 
reduce the bandwidth efficiency. 

One potential way to address this issue is to utilize 
network coding techniques whereby the lost packets 
can be encoded together to reduce the number of re- 
transmissions. For example, 1 10] shows the benefit of 
cooperation at the network layer via a simple XOR 
network coding technique. In Hill , the random linear 
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network coding (RLNC) [6] is adopted for multicast 
applications, and the channel and power allocation in re- 
laying nodes are optimized for maximizing the multicast 
rate. It is shown in 1131 that compared to the physical- 
layer cooperation, the use of RLNC at the relays can 
enhance the system throughput. In II 311 . a RLNC-based 
opportunistic multicast protocol is proposed which can 
alleviate the bottleneck problem effectively. However, to 
avoid throughput degradation, the block size in RLNC 
has to scale with the the number of receivers 11411 , which 
in turn leads to large decoding delay and complexity. In 
addition, the centralized scheduling policies in JkHLzI] 
rely on the feedback from the relays and receivers about 
the packet reception status, which make them difficult 
to scale to larger network size in practice. 

In this paper, a cooperative multicast protocol named 
MWNCast is proposed based on the moving window 
network coding (MWNC) technique. By exploiting the 
residual capacity of relay nodes to serve the bandwidth 
starving receivers, the proposed scheme can effectively 
alleviate the bottleneck problem in wireless multicast. 
Based on the random walk and point process theory, 
we prove three fundamental properties of MWNCast. 
Firstly, without explicit feedback, MWNCast can ap- 
proach the cooperative capacity with the packet loss 
probability dropping almost exponentially with the in- 
crease of window size. Secondly, if the coding window is 
large enough such that the packet loss can be neglected 
, the average decoding delay experienced by a receiver 
is 0( (i-p)'-i )/ where p is the traffic intensity of the node. 
Moreover, the decoding delay of different receivers are 
mutually independent, which can guarantee the scalabil- 
ity of the scheme in large networks. Thirdly, MWNCast 
can achieve the minimal decoding complexity 0(W) (W 
is coding window size) for a given target throughput. 
We provide simulation results to validate the theoret- 
ical results, which show that the proposed scheme not 
only can guarantee reliable transmission without explicit 
feedback, but also can achieve high throughput with 
reduced decoding delay and complexity. 

The rest of this paper is organized as follows. In 
Section [2[ the system models assumed in this paper is 
introduced. We present MWNC in Section[3] In SectionlH 
an overview of MWNCast is firstly provided, followed 
by its functional modules in detail. In Section [5j we 
establish the theoretical framework and then prove three 
key properties of MWNCast. Simulation results are pro- 
vided in Section [6] and finally we conclude this paper in 
Section 

2 System Model 

We consider a wireless network consisting of a source 
node s and a set of J\f receivers. The source node has 
a stream of packets to be transmitted to all receivers. 
As discussed in previous section, for lossy wireless 
networks, the capacity of plain broadcast (even with 
a sophisticated network coding scheme) is limited by 



the worst receiver. To address this problem, we adopt 
a cooperative networking structure, whereby a subset 
TZ C J\f of nodes are selected as relays , which perform 
not only the normal receiving function to receive data 
from the source, but also the relaying function that for- 
wards the received data to the remaining subset £ of end 
receivers (£ = Af\lZ). To simplify the design of protocol, 
we assume that relay nodes only receive data from the 
source, while the end receivers can receive data from both 
the source and the relay nodes. 

Similar to [11], we assume that there are K orthogonal 
channels that can be operated by each node^. Therefore, 
in order to avoid co-channel transmission interference 
between the source and the relay nodes, at most K — 1 
relay nodes are allowed to transmit concurrently with 
the source. Time is divided into slots, and each node is 
equipped with one half-duplex radio, so a relay node 
cannot receive and relay at the same time. 

To characterize the lossy nature of the wireless chan- 
nel, let d.j denote the packet reception probability 
(PRP) for a pair of nodes i and j |15|| . In this paper, 
we assume the PRPs of all links in the network are 
quasi-static and collected by the source node through 
some online or offline measurements [16] [17]. Note that 
Ci j is equivalent to the capacity of link since it 

is the maximum achievable throughput for error-free 
transmission from node i to j. In the following, without 
abusing the notation, we refer to Cij the PRP as well as 
the capacity of the link. In particular, let Cqj denote the 
link capacity from the source to any node j e M. 

3 Moving Window Network Coding 

To improve the performance of wireless multicast, many 
different network coding techniques have been proposed 
from different perspectives. The random linear network 
coding scheme [6] adopts a block transmission strategy 
which can approach the capacity with less feedback 
overhead. Unfortunately, it is shown in [14] that the 
block size of RLNC has to scale with the increase of 
the number of receivers to avoid the loss of throughput, 
which however will result in large decoding delay. The 
ARQ-based online network coding (ANC) [18] achieves 
the one-hop maximum multicast throughput, but the 
decoding delay of the receivers with worse channel 
conditions is unfairly large. Many solutions have been 
proposed for this problem (e.g., I19l422ll . When the num- 
ber of receivers is small, the schemes proposed in | fl9|l 
and [20] can reduce the decoding delay, but the optimal 
throughput and decoding delay cannot be achieved si- 
multaneously for larger network size l23ll . In |2lll , a delay 
threshold based on scheme is proposed to incorporate 
with the ANC scheme, which can guarantee the decod- 
ing delay to be within the prescribed bound at the cost of 
throughput degradation. The instant decodable network 

1. We assume frequency division multiple access (FDMA) in this 
paper, but it can be easily generalized to time division multiple access 
(TDMA) too. 
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coding can effectively minimize the decoding delay, but 
it cannot guarantee the order of decoding [22]. Note that 
most of these delay control schemes rely on the feedback 
from receivers. With virtually no feedback information, 
the optimal RLNC strategy for delay-constrained traffic 
is studied in 12411 , but the scheme still suffers from 
the throughput degradation problem of RLNC with the 
network scale increase. 

Motivated by these techniques, we propose the 
MWNC scheme to combine the advantageous features of 
traditional network coding schemes[25]. MWNC adopts 
the encoding strategy similar to RLNC, but the block of 
packets to be encoded in each slot is moving forward at 
a constant speed V (see Fig.Q}. Specifically, at time slot t, 
a block of W packets with the sequence number ranging 
from [V • t] — W + 1 to \V ■ t\ are encoded with random 
coefficients on a finite field, which are also transmitted 
with the coded symbol@ After overhearing the coded 
symbols from the source, the receiver attempts to decode 
the original packets through Gauss-Jordan elimination 
approach. A typical example of the decoding process is 
shown in Fig. [2] in which the Gauss-Jordan Elimination 
can be performed progressively as the coded symbol 
arrives and finally the original packets can be retrieved 
when the reduced matrix has full rank (Fig. Efb)). Note 
that V represents the target throughput, so it should be 
within the network capacity. 

In Table [TJ we show an example where the window 
size W = 3 and the moving speed V = 0.5. In this ex- 
ample, the source starts by sending the uncoded packets 
Pi twice in the first two time slots, one of which is lost 
by the receiver. Then it sends coded symbols p% © pi 
with randomly chosen coefficients in the next two slots 
(since \Vt] = 2 for t = 3, 4), one of which gets received, 
so the receiver can successfully decode p2 at the forth 
time slot. From the fifth time slot, a full window of three 
packets are encoded in each time slot, which is moved 
forward with the speed of V = 0.5. Note that because 
there is no feedback mechanism in MWNC, it cannot 
guarantee 100% reliability. For example, p% will get lost 
at the 12 th time slot, since it will never be decoded after 
the window has moved to p$ , p^ , pi, even when the client 
has received the information of pz and p^ at the 8*' 1 time 
slot. However, we will prove in Section [5] that the packet 
loss probability with MWNC drops almost exponentially 
with the increase of window size. 

MWNC has some other interesting properties. Firstly, 
the decoding opportunity exists in each time slot, there- 
fore it avoids the intrinsic decoding delay problem in- 
curred by RLNC. In addition, the decoding opportunity 
is balanced between clients with good and poor channel 
conditions, so none of the clients will be dominated by 
other clients with better channel conditions. Secondly, 
the coding coefficient matrix in buffer is very sparse 
(see Fig. |2j due to the moving window strategy, so the 

2. [•] is the ceil function to guarantee that the boundaries of the 
window are aligned to integer values. Note that if \V ■ i\ < W, then 
the block is started from 1 to \V ■ t] . 
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Fig. 1. Encoding of MWNC. 
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Fig. 2. Decoding of MWNC. 



decoding complexity of MWNC is much lower than 
RLNC. In Section [5j we will develop some theoretical 
models to analyze these properties. 

Note that the concept of network coding over a mov- 
ing window has been considered in [26] and l27ll . In 
1 26], RLNC is incorporated with the congestion win- 
dow in TCP protocol to improve the throughput in the 
lossy wireless environment. In 1 27], SlideOR is proposed 
to encode packets in overlapping window, which can 
avoid the throughput loss in opportunistic routing. Our 
scheme differs from these schemes in the following 
aspects. Firstly, MWNC can achieve better control of 
the decoding delay and complexity with appropriate 
settings of the moving speed and window size, while 
these schemes are best-effort and there is no guarantee 
for the decoding delay at the receivers. In addition, we 
develop theoretical models to characterize the reliability, 
decoding delay and decoding complexity properties of 
MWNC. Secondly, these schemes rely on the feedback of 
the receivers to move forward the coding window, which 
is nontrivial in wireless broadcasting since the ACKs 
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TABLE 1 

Example of Moving Window Network Coding 

(W = 3,V = 0.5) 
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of different receivers have to be carefully scheduled to 
avoid collision. In addition, even if the reliability of 
ACKs can be guaranteed, the feedback delay may lead 
to the degradation of the network throughput[28]. In our 
scheme, the coding window is moved forward according 
to a prescribed moving speed V, which does not rely on 
the feedback from the receivers. Of course, V should be 
carefully set to be within the network capacity to avoid 
overwhelming the receivers, which is not difficult since 
the link capacity is assumed to be quasi-static. If the 
network is dynamic, this parameter should be adapted 
according to the network condition, which however is 
beyond the scope of this paper. 

4 Design of MWNCast 

In this section, we propose MWNCast, a cooperative 
multicast protocol based on the MWNC technique. Be- 
fore elaborating on the details of the protocol, we 
briefly introduce the motivation and basic functionality 
of MWNCast with a simple example. 

4.1 Overview of MWNCast 

Consider a simple example as shown in Fig. HJa), which 
consists of three receivers, the number on each link is 
the corresponding PRE For plain broadcast, it is easy 
to see the capacity of the system is 0.4 due to the 
bottleneck receiver R3, which requires more time to 
receive the same amount of information as that of clients 
i?i and i?2- Therefore, some time is wasted for clients 
Ri and R2 since the information sent by the source is 
not innovative to these two receivers after they have 
received the required data. On the other hand, if these 
two clients have packets that are not received by client 
R3, one of them can forward the packets to client R3 
on behalf of the source on a different channel using 
its residual time, while the other client can continue 
receiving data from the source. Ideally, if clients R\ , R2 
are assigned to devote 1/7 and 1/3 of their time to 
serve client R3 alternately while spending the rest of 
their time to receive from the BS, then the maximum 
achievable throughput for Ri is 0.7 x | = 0.6, and R 2 



is 0.9 x I = 0.6. Meanwhile, client R 3 can receive data 
alternately from clients RI, R2 when they are active, and 
from the BS in the rest time, so its achievable throughput 
is i x0.9+i x0.8+(l-±-±) x0.4 > 0.6 (see Fig.CJb)), 
which suggests that the throughput of 0.6 (packet /slot) 
can be achieved through this cooperation scheme. 

The key to the success of this cooperative strategy 
is the scheduling of the relay transmissions, that is, to 
determine which set of relay node should transmit at 
a specific time slot. To this end, we adopt a stochastic 
scheduling method, which works as follows. At the be- 
ginning of each time slot, the source generates a random 
variable x uniformly distributed in [0, 1]. If < x < y, 
then Ri is selected to relay the data to R3, while R2 
keeps receiving from the source. lij<x<j + ^, the 
roles of Ri and R 2 are exchanged. Otherwise, only the 
source transmits and all clients receive information from 
it. This scheduling decision is broadcasted to all relays. 
Each second-hop receiver always receives from the best 
transmitter (the source or a relay). An example of the 
scheduling sequences is shown in Fig. [3]Jc). 

The source and the selected relays will transmit at the 
scheduled time slot. The packets to be transmitted are 
encoded using the MWNC technique, which range from 
[V • i| — W + 1 to \V ■ t] at time t. The source transmits 
the encoded symbol on its channel, while the selected 
relay node transmits a specific encoded symbol on a 
different channel. For relay node, this encoded symbol 
is generated from a batch of packets (the most close to 
the expected window in the relay's buffer), including 
the newly decoded packets and the combination of 
undecoded packets (e.g., pio, pn, and pi 2 in Fig. E^d)). 
After overhearing the transmissions from the source and 
the relay, the end receiver attempt to decode the original 
packets with Gauss-Jordan elimination technique. 

4.2 MWNCast Protocol 

In this subsection, we discuss the details of MWNCast 
protocol. We have explained how to implement MWNC 
in a cooperative scenario, so in the following we focus on 
the cooperative scheduling in MWNCast, which can be 
decomposed into three modules, namely the selection of 
relay nodes, the allocation of relay time, and the online 
scheduling of relay transmissions. 

4.2. 1 Relay node selection 

The ultimate goal of MWNCast is to alleviate the bot- 
tleneck and maximize the multicast capacity through 
the cooperation of relays. To this end, we propose a 
procedure to search for a set of candidate relay nodes 
that can achieve the maximum capacity. The basic idea 
is as follows. For a given target network capacity Cy, 
we can partition the set of J\f receivers into two groups 
according to the PRPs from the source to these nodes, 
then the nodes with PRPs above Ct will be selected as 
the relay nodes since their residual capacities can be used 
for serving the remaining end receivers. 
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Algorithm 1: Relay node selection 
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Fig. 3. A simple example of MWNCast. 



The rational for selecting relay nodes in this way 
is that the selected relay nodes can meet the target 
capacity requirement. However, it cannot guarantee that 
the remaining nodes can achieve the target capacity 
Ct as well, since their achievable throughput depends 
on their link capacities to the relay nodes, as well as 
how much residual time of these relay nodes can be 
devoted for cooperation. Therefore, it is necessary to 
check the feasibility of this target Ct, which involves 
the computations of the available cooperation time for a 
given set of relays (to be discussed), with which we can 
compute the achievable throughput for each receivers. If 
any of the nodes fail to achieve the target capacity Ct, 
then it means that this target capacity is infeasible and 
a smaller value should be attempted, otherwise a larger 
capacity can be supported. 

Based on this idea, we propose a binary search pro- 
cedure to find the maximum achievable capacity as 
shown in Algorithm [1] The algorithm maintains a lower 
threshold Cl and an upper threshold Cjj for the target 
capacity initially. Then starting with Cr — (Cl + Cu)/2, 
a set of relay nodes with qualified link capacities are 
determined (lines 5-6). The achievable capacities of the 
remaining nodes are computed using Algorithm 2 (line 
7). If the target capacity can be achieved by all nodes, 
then the lower threshold is increased to Ct (line 9), 
otherwise the upper threshold is reduced to Ct (line 
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C L 0, Ce, <- 1; 
while Cu - Cl > A do 

C T <- (Cu - C L )/2; 
n^{j\C j>C T ,jeAf}; 
£ <r- Af\K; 

Call Algorithm 2 to check the feasibility of 
Ct for the relay set TZ and the receiver set £ ; 
if Ct is feasible then 
| Cl Ct; 
else 

| Cu <— Ct; 
end 
end 



14 end 



11). The same procedure is repeated until the upper 
and lower thresholds converge. Finally, the algorithm 
returns the set of qualified relay nodes for the maximum 
achievable capacity C T . 

4.2.2 Relay time allocation 

As discussed in the previous subsection, for a target 
capacity Ct, if a node i has a PRP of Co.i > Ct, it 
is selected as a candidate relay node. In this case, at 
least a Cr/Co,i fraction of its time has to be used for 
receiving data from the source so that the target capacity 
requirement can be satisfied. As a result, its residual 
time is at most (Co,i — Ct)/Co,i, which can be used for 
serving the remaining receiver nodes. Therefore, the next 
problem is to find the allocation of the relay time for each 
relay node under its residual time budget, such that the 
target capacity requirement of the end receiver nodes can 
also be satisfied. If such time allocation exists, it means 
the target capacity is achievable, and vice versa. 

To this end, we propose a relay time allocation al- 
gorithm as shown in Algorithm |2 which proceeds in 
round as follows. In the beginning, each candidate re- 
lay node i € TZ is initialized with the residual time 
C: = (Co,i — Ct) /Co^ (line 2), and each end receiver node 
j G £ has a residual capacity demand Dj = Ct (line 
3). In each round I, a greedy algorithm (algorithm 3) is 
invoked to select a subset TZi from TZ with at most K — 1 
elements, such that the overall capacity of all receivers 
in £ is maximized (line 7). The capacity of a node j 
is determined as Cr^j, whereby R(j) is the node in 
TZi U{ s l that provides the maximum capacity to node j 
among all nodes in TZi. 

Given the relay subset TZi, the next step is to decide 
the time ratio r; that they can devote for relaying. Note 
that since none of the relay nodes should contribute 
more than its residual time, and none of the receiver 
nodes should get service more than its residual capacity 
demand, so the rely time t; for this subset TZi is set to 
the minimum of the residual time of these relay nodes 
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Algorithm 2: Relay Time Allocation 
1 begin 

a 



9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 



{C ,i-CT)/C ,i,VieK; 
Dj <- C T ,Vj e 5; 
Z «- ; 

while K ^ % and £ ^ % and Y,i<t>i < ^ do 
Z<-Z + l; 

Call Algorithm 3 to select % C such 
that | TZi | < A" and Yl,j£E is maximized, 

where <- argmax l£Kl u{s} Cjj; 
0; <- 

min{mm i6Ki Q, min je f Dj/C R ^ j, 1 
foreach z 6 7?./ do 
C s : «- Ci — (/>;; 
if Cj < then ft <- K\i; 
end 

foreach j e £ do 

if < then £ <- £\j; 
end 
end 

if £ == then return {ft, (j>i}i eC ; 
else return {0}; 



20 end 



and the residual demand of all receivers (line 8), then for 
each selected relay node i, its residual time is reduced 
by the amount of <f>i (line 10). If its residual time is used 
up, it is removed from the candidate relay set and will 
not participate in the relay time allocation in the next 
round (line 11). Similarly for each receiver j, its residual 
demand is reduced by an amount of <fii * Cr(j) j> which 
is the effective throughput it will receive from this set 
of relay nodes (line 14). If its demand is satisfied, it is 
removed from receiver set and will be considered in the 
next round (line 15). The same procedure is repeated 
to find the next subset of relay nodes and its relay 
time allocation, until either the candidate relay set ft or 
the receiver set £ becomes empty, or the overall relay 
time reaches 1 (line 8). If the receiver set ft is empty 
eventually, it means that the target capacity demand Ct 
can be met by all receiver nodes, then the algorithm 
returns a list of relay subsets and their corresponding 
relay time; Otherwise, it means the target capacity Ct is 
infeasible and the algorithm returns an empty set. 

In each round of Algorithm 2 (line 7), we need to 
find a subset of at most K — 1 relay nodes that can 
provide maximum capacity to the unsatisfied receivers 
together with the source node. Let B = lZ{J{s} denote 
the set nodes consisting of the candidate relay set ft and 
the source node s. The capacity of a selection of relay 
nodes ft; e B is defined as C(ft/) = J2je£ ^R(i),r ^ ur 
objective is to find a selection IZi with the maximum 
capacity such that the \1Zi \ < K. This problem is known 
as a special case of the generalized maximum coverage 
problem, which is NP-hard 12911 . To solve this problem, 



Algorithm 3: Greedy Maximum Capacity Relay Se- 
lection 

1 begin 

2 Ki <- {s}; 

3 R(j) ^s : Wj e£; 

4 while \Ki\ < K do 

5 Find a relay i g 1Z with the maximum 
residual capacity, i.e., 
i <- argmaxj/^ Y, je £ C k, 
if C ni (i,£) > then 
Ki^Ki® i; 
else break; 

end 

return TZi. 
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ii end 



we introduce the following definitions: 

Definition 1. (residual capacity/weight) Consider a selection 
IZi, a relay i and a receiver j. We define the residual capacity 



CnAhj) to be equal to C itj - C R(J ) d . 

Definition 2. (addition of a relay) For a selection TZi and a 
relay i £ TZi, we define IZi © i as the addition of i to Hi. In 
other words, IZi © i is a new selection TZ'i, and 



R'(j) 



i£C itj > Crcoj 



R(j), otherwise. 



(1) 



Base on these concepts, we develop a greedy algorithm 
as shown in Algorithm [3] The basic idea is to incremen- 
tally add the relay node with the maximum positive 
residual capacity, so that the overall capacity is non- 
decreasing. At line 3, all receivers are initially assigned 
to the source. Then in each round, one of the candidate 
relays that has the maximum positive residual capacity 
is selected to join the relay node set TZi (line 6) until \TZi\ 
exceeds K. It can be proved that this greedy algorithm 
can achieve an approximation ratio of 1 — (1 
to the optimal solution 12911 . 



K-l) 



4.2.3 Online Relay Transmission Scheduling 

From Algorithms 1 and 2, we can find a list C of 
relay node set IZi and the corresponding relay time 
allocation <fii, such that the multicast capacity of the 
system is maximized. Let C* denote the maximum ca- 
pacity corresponding to the results, then for any capacity 
requirement C < C* , we should have: 



C<Y,<ln*C R u )ij ,VjeM. 

lec 



(2) 



From @, we can see that the amount of time that 
a subset IZi to be scheduled for relaying should be 
proportional to <j>i, such that the required capacity can be 
satisfied. Since the time is slotted, as briefly introduced in 
last section, we can adopt a stochastic online algorithm 
to approximate the scheduling. Specifically, let us define 
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ipl as 

^ = x>>v/. (3) 

k<l 

In each time slot t, the source generates a random 
number between and 1, if its value falls between tpf- 
and V^fc+i/ then the fc th subset of relay nodes are selected 
for relaying in this time slot. It is easy to see that this 
stochastic scheduling policy converges to the required 
proportional of time for each relay set in a long run. 
This schedule algorithm can be executed by the source 
in an online fashion at the beginning of each time slot, 
and an unique channel is assigned to each selected 
relay node. The scheduling results (relay nodes and their 
operating channels) are broadcasted to all receivers, then 
they can choose the best relay node and switch to the 
corresponding channel to receive the data. 

5 Analysis 

In this section, we develop some theoretical models to 
characterize the basic properties of MWNCast. Firstly, 
we introduce the equivalent channel capacity model, which 
is an unified model for characterizing the capacity of 
both relay and receiver nodes. Based on this model, 
the decoding delay, reliability and decoding complexity 
properties of MWNCast are analyzed using the random 
walk and point process theories. 

5.1 Equivalent Channel Capacity Model 

As discussed in Section|2 the capacity of a point-to-point 
wireless link is given by the PRP Cij. However, the 
analysis of the link capacity in MWNCast is complicated 
since: (i) a relay node may not stay in the "receiving" 
state all the time; (ii) a receiver node may receive data 
from different relay nodes at different time slots. In 
this subsection, we propose an equivalent channel capacity 
model to characterize the capacity of these two kinds of 
nodes. 

For a relay node i £ TZ, its aggregated fraction of time 
in the "relaying" state is given by <f>j = J^iecie^ 4>i- 
Since the online relay scheduling algorithm is a stochas- 
tic scheme, we can assume that in each time slot, the 
probability for the node to receive from the source is 
1 — $i, and the probability for relaying is $i. Taking 
into account the PRP from the source, we can define 
the equivalent channel capacity Ci for this relay node 
as Ci = (1 — 3?i)Co,i, which is the maximum achievable 
throughput of this node from the source without errors. 

For a receiver node j, if a relay subset TZi is se- 
lected for transmission (with a probability of 4>i), it 
will choose to receive from the best relay node R(j) 
(including the source) with the maximum PRP, i.e., 
R(j) = argmaXj EKi y/ s \ Qj. Therefore, we can define 
the equivalent channel capacity Cj for node j as the 
aggregated throughput from all relay subsets, i.e., Cj = 



Note that this equivalent channel capacity model is 
an approximation of the link capacity for the two kinds 
of nodes in MWNCast, which makes it tractable to 
analyze the reliability and decoding delay properties in 
the following subsections. 

5.2 Preliminary Property of MWNC 

In this subsection, we establish some basic properties 
for MWNC using the random walk and point process 
theories, with which we can analyze the performance of 
MWNCast. 

In the following, we use the "packet" to denote the 
original data, and the "symbol" to denote the linear 
combination of the packets within the window. For any 
receiver with capacity C , let us define G(t) as the number 
of packets which are inevitably lost up to time t, I(t) as 
the total number of received innovative symbols up to 
t. Note that not all innovative symbols can be used for 
decoding the original packets. For example, in Table [TJ 
the symbol received at t = 8 is useless at the end of time 
t = 12 since p$ cannot be decoded ever since. We define 
D(t) as the number of discarded symbols up to time 
t. Then I(t) — D(t) represents the received innovative 
symbols that contain the information for the packets 
covered by the coding window up to time t (except for 
the lost G(t) packets). 

For a MWNC's receiver with capacity C, let us define 
a particle on TZ 1 with its position at time t given by 

S(t) = V xt-G(t)-(I(t)-D(t)). (4) 

We have the following results regarding the decoding 
and loss events for MWNC. 

Lemma 1. Decoding event occurs at time t if and only if 
S(t) < at the end of this time slot. All the packets from the 
last foremost decoded (or lost) packet to the head of current 
window will be decoded. 

Proof: Decoding event occurs at the moment when 
the coding coefficient matrix in buffer is full rank. In this 
case, the number of innovative symbols in buffer must 
be as many as the number of packets covered by the 
window by time t except for the lost ones, i.e., \V x t] — 
G(t). Therefore, we have (I(t) - D(t)) = \V X t] - G(t), 
i.e., S(t) < 0. □ 

Lemma 2. Packet loss event occurs at time t if and only if 
S(t) > W — V at the end of t. Moreover, all the un-decoded 
packets before the tail of the window are lost. 

Proof. The last packet in the coding window moves 
to \V X (t + 1) - W+l] at time t+l. So if the packet right 
before [Fx (t+l)] — W has not been "seen" (i.e., a symbol 
contains this packet has not been received before), then 
all the un-decoded packets before the window will get 
lost forever. In other words, \V X (t+l)] —W > G(t) + 
((I(t) - D{t)), which gets S(t) > W - V. □ 




Decode Lost 
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Fig. 4. Random walk model for MWNCast. 

Lemma 3. If a new packet is lost at time t, the number of 
newly discarded symbols in buffer must be exactly one less 
than the number of packets just get lost, i.e., 

AD(t) = AG(t) - 1. (5) 

Proof: Since the coding window moves constantly, if 
the packet \V x (i + 1)] - W - 1 has not been "seen" at 
time t, then the packet should have been decided as lost 
in the last time slot, which contradicts to the assumption. 

□ 

Accordingly, for the specified receiver, MWNC can be 
modeled as a one-dimensional random walks [30] (see 
Fig. 2). The random walk has two reflecting barriers at 
— V and W—V corresponding to the decoding and packet 
loss events respectively, where S(t) in ((4) corresponds to 
the position of the random walk at time t. Let X denote 
the step size of the random walk, which is a random 
variable with the following density function: 

f(x)=C6(x + d L ) + (l-C)6(x-d R ) (6) 

where di, = 1 — V and dp = V. Moreover, the mean 
and variance of a step are denoted as p = V — C and 
cr 2 = Var(X) = 6(1 - 6). 

The following theorem specifies the behavior of the 
random walk representing the specified receiver: 

Theorem 1. At time t, if the particle crosses the left barrier 
—V, it will be reflected rightward for a distance of dp in the 
next time slot. If the particle crosses the right barrier W — 
V, it will be immediately bounced back for a distance of 1. 
Otherwise, the particle will make a random move according 
to (©. 

Proof: Firstly, notice that if and only if S(t — 1) > — V, 
the received symbol at time t is innovative. That is, the 
window's foremost packet \V x t] is informative to the 
receiver since V(t-l)-G(t-l)-(I(t-l)-D(t-l)) > -V. 

Therefore, when the particle does not cross the two 
barriers, its position at time t relative to the last slot 
S(t) = S(t - 1) + V - AI(t) depends on whether a 
symbol is received successfully, which follows the the 
step function defined in ((6). If the particle just crosses 
the left barrier —V, the received symbol contains no 
new information. Thus I(t) = I(t — 1) and the particle 
moves rightward definitely. If the particle crosses the 
right barrier W — V, it is indicated in Lemma [3] that it 
will be bounced back instantly by a distance of 1. □ 

5.3 Reliability Analysis 

The reliability analysis is complicated in MWNCast since 
the second-hop receivers might suffer from larger packet 



Fig. 5. Point process model for decoding and loss events. 

loss ratio. However, it is easy to see that the proposed 
stochastic scheduling policy converges to the required 
proportional of time for each relay set in a long run 
and consequently the information difference between 
BS and the relays should not be large. In addition, the 
symbols to be transmitted by the relays are generated 
randomly, so that the probability that they contain inno- 
vative information to the second-hop receivers is greatly 
increased. Therefore, without differentiating the relays 
and receivers, we assume a specified client receives 
information on a channel of equivalent capacity C. Based 
on this assumption, the packet loss ratio for both the 
relays and the receivers can be derived. 

We can model MWNCast as a two-state point pro- 
cess [30] as shown in Fig. [5j which corresponds to 
the "Decode" (D) and "Loss" (L) events, respectively. 
Specifically, if the "Decode" event occurs at some time, 
with probability Pdd it will return the same state af- 
ter a random time interval Tdd, arid with probability 
Pdl = 1 — Pdd it will make a transition to the "Loss" 
states after a time interval Tdl- Similarly, we can define 
Pll,Pld and Tll,Tld as the transition probabilities 
and transition time for the "Loss" state. These quantities 
can be derived using the random walk and point process 
theories as follows. 

Firstly, let us define G(9) as the moment generating 
function of X, which is the two-sided Laplace transform 
of the step function f(x) defined in I0, that is, 

G(9) = E[e~ 6X ] = Ce- 6 ^ + (1 - C)e ev . (7) 

From the property of moment generating function, we 
know that: (i) G(9) is a convex function; (ii) If E[X] =/= 0, 
there are two roots for the equation G(9) = 1, one is 
9 = 0, the other is 9 = 9q who has the same sign as p. 

Let —B and A (A. B > 0) denote two absorbing 
barriers for the random walk starting at the origin, we 
can define the stopping time N as 

N = min{n: S{n) <-B or S(n)> A}, (8) 

which is the number of steps to cross one of the barriers 
starting from the origin. 

We can define the moment generating function Gn (9) 
with respect to N and S(N), that is, 

G N (6)=E[e- 8S Ws N ]. (9) 

Suppose that we set s = G(9)^ 1 , then we have 
G N (9) = E[e~ es ^G(9)- N }. For this equation, we can 
find 9 = #o such that G(9) = 1, then it is easy to verify 
that e~ es ( n >G(9)~ n is a martingale with mean 1 since it is 
the product of independent unit mean random variables. 
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According to the martingale stopping theorem (Theorem 
6.2.2 in 13111 ), we can obtain: 



E[e~ es W] = 1. 



(10) 



Since the events of S(N) < —B and S(N) > A are 
independent, from ([Sj and flOl , we have: 

E[e- es(N) \S(N) > A]P A + E[e- 0S<N) \S{N) < -B]P- B = 1. 

(11) 

For the absorbing states A and —B , we can get the 
following approximations: 



E[e- es(N) \S(N) > A] 



E[ e - 9S <- N ^\S(N) < —B] 



Substituting these two approximation equations into 
((Tol l, and using the fact that Pa + P-b = 1/ we can get 
the probabilities of absorption at A and -B as 



Pa 



P-i 



(12) 



where 6$ is the non-zero root of the equation G(9) = 1. 

To derive the distribution for iV, let \%(s) and A2(s) 
denote two real roots of the equation G(8) = 1/s. Then 
from 10, we can obtain a different expression of i fTOt with 
respect to N: 



E[e -X 1 (s)S(N) s N ] = 1; E[e -X 2 (s)S(N) s 



Ah 



(13) 



Using the approximation S(N) ~ A when S(N) > A, 
and S(N) ~ —B when S(N) < -B, we have: 



P A e 



E A {s N ) + P- B e^ a E_ B { S ") = l,t = 1,2, 



(14) 

where -Ea and denote the conditional expectations 
at A and — B, respectively. Using Pa and P-b given by 
(UJi, we can obtain E A (s N ) and E_ B {s N ) from CDO. Then 
we have the moment generating function for ./V as: 



E[s N ] = P A E A {s N ) + P- B E- B {s N ) 



(15) 



By differentiating Eq. JTBI l with respect to s, we can 
obtain the first and second moments of N respectively. 

To derive the packet loss ratio, we assume the par- 
ticle is always located at the largest possible position 
after an event, which gives the upper bound for the 
loss probability. By Theorem [TJ after an "D" event, the 
particle's maximum position is at da; After a "L" event, 
the particle must get back to at most W—l before making 
a random move. Therefore, the transition probabilities 
Pdd, Pdl, Pld and Pll between these two events, and 
the expected transition time Tdd,Tdl,Tld an d Tll can 
be derived using fT2] | and flBl respectively assuming the 
particle starting from the corresponding position. 

The equilibrium distribution of the embedded Markov 
chain for two-state point process can be given by 

kdPdlTdl+'k l P ll T ll +'k d P dd T dd +'k l P ld T ld , then 
the proportion of time passed from states "L" and "D" to 
state "L" are given by ir D P DL T DL /T and n L P LL T LL /T, 
respectively. 

Note that MWNC has two packet loss scenairo de- 
pending on the previous event, "L" to "L" and "D" to 



"L". If it is from "L" to "L", all the packets covered 
by the window will get lost, so the number of lost 
packets should be proportional to the transition time 
T LL . If it is from "D" to "L", then the lost packets 
should be the number of packets covered from the time 
of previous event minus W since only the packets behind 
the window will get lost. Therefore, the overall packet 
loss probability can be obtained as: 

ir D P DL {T DL - W/V) + ir L P LL T LL 

Ploss = j, ' U 6 ) 

5.4 Decoding delay analysis 

For a receiver in MWNCast, the delay for receiving a 
packet is composed of two parts: the queueing delay and 
the decoding delay. The queueing delay can be analyzed 
using the similar procedure in 11811 . In the following, 
we will focus on the decoding delay and consider a 
saturated system in which the source always has packets 
to transmitted. 

Attributed to the stochastic scheduling policy, any 
client (a relay or a receiver) in the network can be ap- 
proximately considered as connected to the information 
source by MWNC on a channel with equivalent capacity 
C. If the window size is sufficiently large, the packet loss 
probability is negligible. In this case, we can derive an 
upper bound for the average decoding delay assuming 
that W —> oo, whereby the random walk is simplified 
to a single left barrier at with a starting point dp. The 
barrier at indicates the moment of decoding. Assume 
the particle always reflects back to the largest possible 
position du after a decoding event. According to Eq. 1151 1, 
when A approaches to infinity and B is set dn, we have 
P-b = 1 and Pa = 0. The first two moments of N are 
derived from flBl as follows: 



E[N}~~^,E{N 2 } 



dn 2 [i 



(17) 



The decoding delay for a packet is defined as the time 
duration from the moment that it is encoded to the time 
that it is decoded. We can model the decoding process 
as a renewal process. Then the sum of decoding delay 
for the packet in a renewal period iVj is given by 

D(NA ~ ^ • NiV, 

where iVj/2 is the average decoding delay for a packet, 
NiV is the average number symbols transmitted during 
this time period. 

By the theory of renewal reward process and the 
definition of average decoding delay, we have 



D = lim 



1 



E{D) 



, D(Ni) = 
t^oot-V^ v ' E{N)-V 



1E\N 2 



2 E[N] 



(18) 



From i TTTl l and (TBI , we can prove that D approaches 
1° Q( (i_ p )a ) asymptotically, where p = V/C is the traffic 
intensity of the receiver. 
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5.5 Decoding Complexity 

In this part, we analyze the decoding complexity of 
MWNC assuming the window size W is sufficiently 
large such that the packet loss can be negligible. 

The decoding complexity of MWNC is composed of 
two parts: the forward elimination and the backward 
substitution. Suppose that a receiver has just decoded 
all packets up to \V x to] at time to, and t%, t%, ffc-i 
denote the time instances that the receiver receives a set 
of k — 1 encoded symbols but cannot decode them, until 
at time tk it receives the fcth symbol and is able to decode 
all the received symbols. In the following, we use Sj to 
denote the symbol received at time tj(j = 1, • ■ • , k). 

Lemma 4. The number of nonzero entries in the j th symbol 
sj after the forward elimination is \S(tj )~\ + 1. 

Proof: The forward elimination can assure that all 
the previously "seen" packets can be reduced from the 
newly received symbol. At time tj, the receiver has 
received I(tj) — D(tj) useful symbols. Hence, except 
for G(tj) inevitably lost ones, all the packets up to 
G(tj) + I(tj) — D(tj) have been "seen" even they may 
not be all decoded. The new symbol generated in the t/s 
coding window can be reduced at the corresponding po- 
sitions except for the last one G(tj)+I(tj) — D(tj), which 
becomes "seen" for the sake of the symbol received at 
tj. Hence, there are \V x tj] - G{tj) - (I(tj) - D{tj)) + 1 
nonzero entries are left, which equals to \S(tj)~\ accord- 
ing to Eq. (@). □ 

Lemma 5. The number of arithmetic operations required for 
forward elimination of the j th symbol Sj is W — \S(tj)] — 
J +_Spr 1 (^)l + 1) when j + \(S(tj)] < W, and 
Ei=]_ w +i +r( s fe )l(r 5 (*i)l + 1 ) otherwise. 

Proof: The packets from \tj x V] — W+l to \tj x V~\ are 
used to generate the specific symbol. Among them, the 
receiver may have decoded a number of packets. Since as 
Lemma [4] suggests, the number of nonzero entries after 
elimination is \(S(tj)~\ + 1 and there are j — 1 previously 
eliminated results, the receiver only has a decoded inter- 
section with the coding window if j + \(S(tj)~\ < W. In 
this case, the first W — [S(tj)~\ — j packets in the window 
are already decoded by the receiver, so the same number 
of calculations are needed to eliminate these entries. 
Then, the previously reduced symbols (sj,i < j) can be 
used to eliminate the corresponding entries, which takes 
[S^i)! + 1 operations for each symbol Si according to 
Lemma 31 When j + \{S(tj)~\ > W, there are no decoded 
packets in the coding window, thus the receiver can only 
use the existing eliminated results to reduce the symbol. 
The elimination for the W — \(S(tj)~\ — 1 positions takes 
Ei=jC^ + i +r(s( t j)1 (\S(U)~\ + 1) calculations. □ 

Theorem 2. Given the target throughput V, MWNC can 
achieve the optimal decoding complexity of 0(W). 

Proof: A network coding symbol is encoded with 
W packets, a receiver needs at least W — 1 operation 



to decode the original information, so a trivial lower 
bound for the decoding complexity is 0(W). Therefore, 
it is sufficient to prove that, for every receiver in the 
network, the decoding complexity of MWNC is upper 
bounded by 0(W). 

According to Lemma |5j an obvious upper bound of 
complexity for the j th symbol's elimination procedure 
is W + Y,Zr 1 (\ s ^)~\ + X S Thu s, the total number of 
computations for the forward elimination of k packets is 
upper bounded by + (\S(U)] +1))- The 

total number of computations for backward substitution 
is Ej=i( 1^(^)1 + 1)' so m e overall complexity fi(fc) for 
decoding k packets is bounded by: 

m < wk+ (±±nh + j2j2\s(u)] 

3 = 1 i=l (19) 

< (fc + 3)fc (fc + l)fc 
2 + 2 ' 

where the second inequality is valid because [S^i,)] < 
W. 

Note that as discussed in previous subsection, the 
decoding event occurs after a random time duration of 
N, so k = \V(t + N)] - \Vt] ~ VN. According to the 
renewal reward process and (19) . the average decoding 
complexity can be obtained as follows: 

E(n(N)) E{N*)V + 3E(N) E(N 2 )V + E(N) 
~ E{N)-V 2E(N) 2E(N) 

(20) 

From (17) , we know that given the throughput, E(N) 
and E(N 2 ) are independent of W , therefore, the decod- 
ing complexity is dominated by the window size, which 
is on the order of 0(W) from (|20) □ 

6 Simulation Results 

In this section, we provide extensive simulation results 
to compare the performance of MWNCast with other 
network coding-based multicast schemes, and also to 
illustrate the advantages of MWNC technique. The net- 
work topologies in simulations are generated randomly, 
whereby the location of all clients is uniformly dis- 
tributed around the source. The channel between any 
two nodes is assumed to follow the Rayleigh fading 
channel model. The coefficients of network coding are 
generated on a G(2 8 ) Galois Field. The time duration 
for each simulation is 10 5 time slots. 

6.1 Throughput and Decoding Delay 

In Fig. we compare the achievable throughput of 
MWNCasfH with those of RLNC and ANC under dif- 
ferent network sizes, where the window size of MWN- 
Cast is set to 20. When the channel number K = 1, 

3. For j = 1, the first symbol certainly needs no more than W oper- 
ations, thus without abusing the notation, we assume Y2i=i([S(ti)] + 
1) = 0. 

4. The packet loss probability is controlled under 10 3 as will be 
explained in the next subsection. 
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MWNCast reduces to MWNC (simple multicast without 
cooperation). We can see that the achieved throughput 
of MWNC is close to ANC (which is known to be 
throughput-optimal), but it outperforms RLNC under all 
network conditions. It also can be seen that the through- 
put of RLNC decreases as the network size increases, 
which has been discussed in the literature 11411 . When 
multiple channels are available, we provide the simula- 
tion for RLNC with the same relay scheduling strategy 
as in MWNCast for fair comparison. It is observed that 
for K = 2 and K = 3, MWNCast preserves its supe- 
riority over cooperative RLNC (denoted by CoopRLNC 
in Fig. [6} with the average performance gain of 21.5% 
and 22.5%, respectively. Therefore, we can see that by 
taking advantage of moving window network coding 
and cooperation, MWNCast is effective in improving the 
system throughput. 

In Fig. we study the tradeoff between the decoding 
delay and throughput for a network with 100 nodes. 
Note that the result of ANC is not included since the 
decoding delay of the receivers with poorer channel 
condition increases with the simulation time, which is 
unfair for comparison. In this figure, the lines and bars 
show the average and maximum decoding delays of all 
receivers respectively for each scheme. It is noteworthy 
that, to maintain a given throughput, with the newly 
proposed MWNC-based scheme, the average decoding 
delay for a successful decoded packet is much lower than 
that with the RLNC-based scheme. The reason is that the 
decoding opportunity with MWNC exists in each time 
slot, but with RLNC, it cannot decode until receiving the 
full block of packets. 

6.2 Reliability 

In Fig. [51 we study the effect of window size W on the 
packet loss ratios of MWNCast under different system 
traffic load conditions when the channel number K = 
2. The packet loss counted in the simulation can occur 
at any clients in the network. From the figure, it can 
be seen that the theoretical results match well with the 
simulation results. Also, we notice that, the packet loss 
probability drops almost exponentially with the increase 
of window size. Moreover, we can see the requirement 
of packet loss probability of 10~ 3 can be satisfied with 
W — 20 even when the traffic load is as high as 0.9. 

If a certain degree of packet loss can be tolerated 
(i.e. 10~ 3 ~ 10 _1 ), we study the relationship between 
the average decoding delay and packet loss probability 
by adjusting the window size. We consider a specific 
receiver with the packet erasure probability equals to 0.3. 
As shown in Fig. |9j for a given traffic load, the packet 
loss probability increases with the decrease of window 
size, but the average decoding delay for the packets not 
lost also gets smaller. As discussed in Lemma 13 this is 
because packet loss is inevitable when and only when 

5. System traffic load is defined as p = V/C* . 
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Fig. 6. Throughput vs. Network size. 

the the last unseen packet is just the one before the lat- 
termost packet in the window. Such packets, if not lost, 
will encumber the decoding of the following packets, 
leading to larger overall decoding delay. Therefore, with 
a smaller coding window size, MWNCast acts as a delay 
filter to force the receiver to drop the packets which may 
lead to the degradation of the overall delay performance. 

6.3 Decoding Complexity 

In Fig. [I0l we compare the decoding complexity of 
MWNC and RLNC. It can be seen that the decoding com- 
plexity of MWNC is much smaller than that of RLNC. It 
is found that to decode RLNC, the forward elimination 
is the dominating part (i.e., 0(W 3 )) as W increases and 
hence the average complexity for decoding an original 
packet is 0(W 2 ). Nevertheless, in order to achieve higher 
throughput, larger batch sizes have to be used, so the 
decoding complexity of RLNC increases dramatically 
with the growing of throughput. For MWNC, the de- 
coding coefficient matrix of the receiver is sparse since 
it has smaller number of non-zero items. As a result, the 
decoding complexity consumed by Gauss Elimination is 
significantly reduced (proportional to the window size) 
and has little dependency on the throughput. 

7 Conclusions 

In this paper, we proposed MWNC, a novel network 
coding scheme that has smaller decoding delay, lower 
complexity and no need for feedback from the receivers. 
Based on this technique, we further present the MWN- 
Cast protocol to address the bottleneck problem in wire- 
less multicast through cooperative relays. Theoretical 
analysis shows that the proposed schemes can approach 
the network capacity with the packet loss probability 
dropping almost exponentially with the increase of win- 
dow size, the average decoding delay is on the order of 
Q( (i-p)a ) an d the decoding complexity is on the order 
0(W). Simulation results are provided to validate the 
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Fig. 7. Decoding delay vs. Throughput. 



Jr 50 
1 

m 40 



- + - p=0.95 
•x- .p=0.9 
-□- p=0.85 



Packet loss probability 



Fig. 9. Decoding delay vs. Reliability. 
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Fig. 8. Packet loss probability vs. Window size W. 
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Fig. 10. Decoding complexity vs. Throughput. 



theoretical results and compare the performance of the 
proposed schemes with the existing solutions. 
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